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Abstract; [Aim] Aphididae insects are extremely polyphagous agricultural pests. The present study aims to 
[Methods] The 


complete mitogenome of Myzus persicae (Hemiptera; Aphididae) was sequenced by next-generation sequencing 


better understand the phylogenetic relationship within Aphididae by mitogenome analysis. 


(NGS) and PCR amplification, and sequence analysis in comparison with other Aphididae insects was made. 
The Bayesian and Maximum Likelihood phylogenetic analyses based on the 13 PCG sequences were performed. 
[ Results] The mitogenome of M. persicae (GenBank accession no. KU, 236024) is 17 832 bp in size and 
harbors an А + T content of 84.1% , an AT-skew of 0.094 and a GC-skew of — 0.296. It contains 13 protein- 
coding genes ( PCGs) (coxl —3, nadl -6, nad4L, аірб, atp8 and су), 22 tRNA genes, 


( rrnL and rrn$) and two long non-coding regions. 


two rRNA genes 
The gene order and orientation of the mitogenome of M. 
All protein-coding genes ( PCGs) start with an ATN initiation 
codon and terminate with a TAA stop codon, except for nad4 , which terminates with a single T nucleotide as an 


persicae are similar to those of other aphids. 
incomplete stop codon. In the mitogenome of M. persicae, there is a non-coding region (307 bp) between 
tRNA“ and tRNA", which includes variable numbers of tandem repeats in a lineage-specific manner. The 
mitogenome of M. persicae has the control region of 2 531 bp which is the longest in all the sequenced aphid 
mitogenomes. In the mitogenome of M. persicae, six domains with 44 helices are present in the secondary 
structure of rrnL and three domains with 24 helices are present in the secondary structure of rrnS. Phylogenetic 


analysis based on 13 PCGs from M. persicae and 20 other insect species showed that every superfamily in the tree 
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Guizhou 551700, China) 


forms a monophyletic clade. 
Macrosiphini had statistically high values. 


positioned most basally within the clade of Macrosiphini. 


M. persicae was clustered with D. noxia, 


[ Conclusion] Within the Aphidinae subfamily, the monophyly of Aphidini and 


and C. salicicola was 
The results of this study provide some valuable 


molecular data for reconstructing the phylogenetic relationship in Aphididae. 
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1 INTRODUCTION 


In general, insect mitochondrial genomes 
(mitogenomes ) are double-stranded DNA molecules 
that are 15 —20 kb in length; and contain 13 protein- 
coding genes ( PCGs), 22 transfer RNA (tRNA) 
genes, two ribosomal RNA (rRNA) genes, and a 
control region ( Boore, 1999; Cameron, 2014; Wang 
2015b ). Insect gene orders are mostly 
identical to the inferred ancestral arrangement, but 
some species show gene rearrangements ( Boore, 
1999; Dowton et al., 2002; Thao et al., 2004; Lee 
2009; Cameron, 2014). At present, 
than 113 complete hemipteran mitogenomes have 
been sequenced and annotated ( Wang et al., 2015b; 


Li et al., 2016). Mitogenomes are usually used in 


et al., 


et al., more 












































RNA; 


secondary structure; phylogeny 


systematic, and popu- 


2008; Li et al., 2012; Ma 


biogeographic, molecular, 


lation studies ( Hua et al., 
2012). 

Myzus persicae ( Sulzer )  ( Hemiptera; 
Aphididae) is an extremely polyphagous agricultural 
pest, 
including many major crops such as beans, potatoes, 


et al., 


feeding on more than 40 families of plants 


tobaccos, and so on, causing direct damage to plants 
and indirect damage by the поп-регѕіѕіепі 
transmission of important viruses ( Nikolakakis et al., 
2003). Aphididae contains more than 5 000 known 
species, and shows considerable genetic variation 
with regards to host-plant adaptation (Li et al., 
2012; Wang et al., 2013; Wang et al., 2015b). 
Until now, 11 mitogenomes of Aphididae have been 


sequenced (Table 1). In the present study, we 
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Table 1 The mitogenomes of insects used in this study 











Species Family Size (bp) GenBank accession no. References 
Myzus persicae * Aphididea 17 382 KU_236024 This study 
Acyrthosiphon pisum * Aphididea 16 971 NC. 011594 IAGC, 2010 
Aphis gossypii * Aphididea 15 869 NC_024581 Zhang et al., 2016 
Aphis glycines Aphididea 12 529 KC_840675 Wang et al., 2013 
Cavariella salicicola * Aphididae 16 317 NC_022682 Wang et al., 2013 
Cervaphis quercus * Aphididea 15 272 NC. 024926 Wang et al., 2014 
Diuraphis noxia * Aphididea 15 784 NC. 022727 Wang et al., 2014 
Hormaphis betulae Aphididea 15 088 NC. 029495 Li et al., 2015 
Pterocomma pilosum Aphididea 13 002 KC. 840676 Wang et al., 2013 
Schizaphis graminum * Aphididea 15 721 NC. 006158 Thao et al., 2004 
Sitobion avenae * Aphididea 15 180 NC. 024683 Zhang et al., 2014 
Mindarus keteleerfoliae Aphididae 18 199 KP. 722576 Wang et al., 2015a 
Aleurochiton. aceris * Aleyrodidea 15 388 NC. 006160 Thao et al., 2004 
Aleurodicus dugesii * Aleyrodidea 15 723 NC. 005939 Thao et al., 2004 
Bemisia afer * Aleyrodidea 14 968 NC. 024056 Wang et al., 2016 
Bemisia tabaci * Aleyrodidea 15 322 NC. 006279 Thao et al., 2004 
Neomaskellia andropogonis * Aleyrodidea 14 496 NC. 006159 Thao et al., 2004 
Tetraleurodes acaciae * Aleyrodidea 15 080 NC. 006292 Thao et al., 2004 
Trialeurodes vaporariorum * Aleyrodidea 18 414 NC. 006280 Thao et al., 2004 
Cacopsylla coccinea * Psyllidea 14 832 NC_027087 Que et al., 2015 
Pachypsylla venusta * Psyllidea 14 711 NC. 006157 Thao et al., 2004 
Paratrioza sinica * Psyllidea 14 863 NC. 024577 Zhang et al., 2014 
Abidama producta Cercopidae 15271 NC. 015799 Liu et al., 2014 
Aquarius paludum Gerroidea 15 380 NC. 012841 Hua et al., 2009 
Coptosoma bifaria Pentatomoidea 16 179 NC. 012449 Hua et al., 2008 
Nilaparvata. muiri Delphacidae 14 371 NC. 024627 Unpublished 


* Species used in phylogenetic analysis in this study. 





sequenced and annotated the complete mitogenome of 
M. persicae , which typifies the subfamily Macrosiphini. 


2 MATERIALS AND METHODS 


2.1 Experimental sample and genomic DNA 
extraction 

The specimens of M. persicae used in this study 
were obtained from a laboratory stock reared on 
tobacco in June 2015. The voucher specimens are 
deposited in Institute of Entomology, Guizhou 
University, Guiyang, China ( GZU-HO-000011 ). 
They were washed twice by vortexing in absolute 
alcohol and dried out at room temperature before 
DNA extraction. Total DNA was isolated using a 
DNeasy Tissue Kit ( QIAGEN, 


according to the manufacturer’ s protocols. The DNA 


Germany ), 


was sequenced by next-generation sequencing 
(NGS) (Illumina HisSeq 4000 and 2Gb raw data, 


Berry Genomic, Beijing, China ) and PCR 


amplification. 


2.2 Assembling sequence, PCR amplification, 
cloning , and sequencing 

The sequence data were assembled by Geneious 
R9 (Kearse et al., 2012). When necessary, 
species-specific primers ( Table 2) were designed 
based on known sequence by the program Primer 6.0 
(Premier Biosoft, Palo Alto, CA, USA). The 
fragments were amplified with Taq DNA polymerase 
(TaKaRa, Dalian, China) under the following 
conditions; 95°С for 3 min; 35 cycles of 95°C for 30 
s, 48 —55'C (depending on primer combination) for 
30 s and 70°С for 2 min (depending on putative 
length of the fragments) ; and 70°С for 10 тїп. АП 
the PCR fragments were cloned into the pMD19-T 
sequencing vector ( TaKaRa) and sequenced by 
Invitrogen Biotech ( Beijing, China) and Sangon 
Biotech ( Shanghai, China). Each fragment was 
assembled into a contig and then assembled into the 
complete mtDNA by SeqMan ( DNASTAR Ince., 
Madison, WI, USA). 
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Table 2 PCR primers used in this study 





Primers Primer sequence (5' - 37) Annealing temperature (C ) Fragment size (bp) 
MP-F1 AAAAGAAGCTGCTAACTATC 48 550 
МР-КІ GGTATTTGAAGTTGGTTGAA 

MP-F2 TTCAACCAACTTCAAATACC 48 500 
MP-R2 ATCATTATGTGGTTTTCCTT 

MP-F3 GAAGAAACAGGAGTAGGTGC 50 900 
MP-R3 ТАТАТСТТТТССТССТТСТТ 

MP-F4 GATAACCCCAACCATAAATT 52 800 
MP-R4 AAGTTTGATCTAATGGGTGA 

MP-F5 АТААТАСССТАТСТААТССТАСТТТ 48 з 000 
МР-В5 СААСТАСАААССТСАТСССТ 





2.3 Sequence analysis and gene annotation 
Thirteen PCGs, two ribosomal RNA genes and 
the control region of the mitogenome of M. persicae 
were identified by comparison with the mitogenome 
Aphididae 


nucleotide sequences of 13 PCGs were translated into 


sequences of other insects. The 
amino acid sequences on the basis of the invertebrate 
mitochondrial genetic code. The relative synonymous 
codon usage ( RSCU) values and A + T content of 
nucleotide sequence were calculated using MEGA 
6.0 (Tamura et al., 2013). 
skewness was calculated according to the formula: 
AT skew=; A- T]/[A* T], GC skew = [G- 
C]/[G-*C] (Perna and Kocher, 1995). The 
tRNA genes and secondary structure were identified 
using tRNA-SE v. 1. 21 (Schattner et al., 2005) and 
ARWEN (Laslett and Canbück, 2008) , and drawn 
using DNASIS v. 2. 5 ( Hitachi Engineering, Tokyo, 
Japan). The secondary structures of the two rRNA 
genes ( rrnL and rrnS) were predicted after the 


The composition 


Cavariella salicicola (Wang et al., 2013) and drawn 
by RNA structure ( Reuter and Mathews, 2010). 
2.4 Phylogenetic analysis 

Eighteen Aphididae , 
Aleyrodidae and Psyllidae species (marked with an 
asterisk in Table 1) downloaded from the GenBank 
and the mitogenome of M. persicae were used to 
phylogenetic 
Sternorrhyncha, and Nilaparvata muiri, Aquarius 


mitogenomes of 


analyze the relationship | among 
paludum , Coptosoma bifaria and Abidama producta 
were used as outgroups. The sequences of 13 PCGs 
were aligned by MEGA v. 6. 06 ( Tamura et al., 
2013) and the exported data were formatted by 
Clustal X ( Thompson et al., 1997) for phylogenetic 
analysis. Phylogenetic analysis was performed using 
Bayesian inference ( BI) analysis with MrBayes v. 
3.1.2 (Huelsenbeck and Ronquist, 2001) under 
the TVM + I + С model and Maximum Likelihood 


with MEGA v6. 0. The model was chosen with 
Modetest 3. 7 ( Nylander et al., 2004). BI analysis, 
two simultaneous runs of 1 000 000 generations were 
conducted for the matrix. The “ temperature ” 
parameter was set to a default value of 0. 2. Each 
one was sampled every 1 000 generations with a 
burn-in of 2596. ML tree was evaluated through 
MEGA 6.06 (Tamura et al., 2013) using the test of 
bootstrap method with 1 000 replications under 
Kimura 2-parameter model. 


3 RESULTS 


3.1 Genome structures and base composition 

The complete mtDNA of M. persicae ( GenBank 
accession no. КО 236024) is 17 832 bp in size 
(Fig. 1). The mitogenome contains 13 PCGs 
(сох1 -3, nadl —6, nad4L, atp6, atp8 and суф), 
22 tRNA genes, two rRNA genes ( rrnL and rmS). АП 
genes ( Table 3) are similar to those described in other 
hemipteran insects ( Wang et al., 2015b). 

The nucleotide composition of the whole 
mitogenome of M. persicae is as follows; A, 46.0% ; 
T, 38. 1%; С, 5.6%; and C, 10. 396. This is 
similar to that of other insects. The nucleotide 
composition, AT-skew, and GC-skew are shown in 
Table 4. 

3.2 Protein-coding genes 

The mitogenome of M. persicae includes 13 
PCGs, Which is similar to other Aphididae insects 
( Wang et al., 2013, 2015a; Zhang et al., 2014; 
Zhang et al., 2016). Nine of the 13 PCGs (nad2, 
сохі, cox2, atp8, atp6, cox3, nad3, nad6, and 
cytb) are coded on the majority strand ( J-strand) , 
four PCGs are coded on the minority strand ( N- 
strand) (Table 3 and Fig. 1). The start codon of 
the PCGs is a typical ATN, five (nad2, coxl, cox2, 
atp8, and nad4) starting with ATA, five ( арб, 
nad3 , nad5 , nad6, and nadl) with ATT, and the 


remainder with ATG. Alternative start codons have 
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Map of Ше mitogenome of Myzus persicae 


Graphic representation of AT ( green) and GC (blue) contents and their variation throughout the genome. 


been found in other insects, such as TTG in 
Hackeriella veitchi ( Lutz-Bonengel et al., 2004), 
GTG in Vargula hilgendorfii ( Ogoh and Ohmiya, 
2004) , ATAA in Drosophila melanogaster ( Ballard , 
2000) , and so on. All PCGs terminate with the stop 
codon TAA, except for nad4 which uses a single T 
nucleotide as an incomplete stop codon. А partial 
stop codon can usually occur in insects ( Wang 
et al., 2015b). 
3.3 Codon usage 

The RSCU values of the mitogenome of M. 
persicae are summarized in Table 5. The 13 PCGs in 
the mitogenome of M. persicae consist of 3 648 
codons in total. The codon GCG is absent in these 
PCGs. Five frequent codons are UUU (F), UUA 
(L), AUU (I), АПА (1), and ПАП (Y). These 
codons are all composed of A and U nucleotides, 
indicating a high usage of A and T in M. persicae 
PCGs. This is similar in other Aphididae insects 
( Wang et al., 2015a). 
3.4 Transfer and ribosomal RNA genes 

The mitogenome of М. persicae contains a 
typical set of 22 tRNAs, and all of them were 
predicted by tRNA-SE (Schattner et al., 2005) and 
ARWEN (Laslett and Canbáck, 2008). The tRNA 
genes range from 62 bp (tRNA™) to 73 bp 
(tRNA) in size. Fourteen tRNA genes аге coded 
on the J-strand and the others on the N-strand. The 


secondary structures of the M. persicae tRNAs are 
shown in Fig. 2. All tRNA genes are folded into a 
typical clover-leaf secondary structure, except 
tRNA*"( AGN) which lacks a DHU arm and this is a 
common feature in hemipteran insects. 

The ranges of rRNA genes were identified by 
alignment with other Aphididae species. The large 
ribosomal gene (rrnL) and small ribosomal gene 
(rrnS) of M. persicae are located between tRNA‘ 
(CUA) and tRNA“ and between tRNA and the A 
+ T-rich region (Fig. 1). The rrnL and rrnS genes 
are 1 260 and 812 bp in length, respectively , With 
an AT content of 84.9% ( Table 4). The secondary 
structures of rrnL and rrnS were predicted based on 
the secondary structure models from other insects 
( Gillespie et al., 2006; Wang et al., 2013; Ma et 
al., 2016). In the mitogenome of M. persicae, six 
domains with 44 helices are present in the secondary 
structure of rrnL (Fig. 3), which is very similar 
with that in C. salicicola, the first reported in 
Aphididae ( Wang et al., 2013). However, some 
differences exist between different aphid groups, for 
example, the secondary structure of rrnL of M. 
persicae has a bigger loop (22 bp) than C. salicicola 
(5 bp) in helix H2077. Three domains with 24 
helices are present in the secondary structure of rrnS 
in the mitogenome of M. persicae ( Fig. 4), and 
H673 is slightly different in C. salicicola. 
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Table3 The organization of the mitogenome of Myzus persicae 
Gene Direction Position Size (bp) Anti/start codon Stop codon Intergenic nucleotides 
tRNA". (0 J 1-65 65 GAT £ 
tRNAS™ (9) N 63 -128 66 TTG - 11 
tRNAMet M) J 140 - 206 67 CAT 一 0 
nad2 J 207 -1 184 978 ATA TAA -3 
ТЕМА"? OW) J 1 182 -1 249 68 TCA - -9 
tRNACY (©) N 1 241 - 1 308 68 GCA - 4 
tRNAT (0) N 1 313 - 1 378 66 TGT = 1 
сох1 1 1 380 -2 915 1 536 ATA TAA -5 
tRNALIe 71 (2 J 2 911 -2 978 68 TAA = 3 
cox2 J 2 982 -3 653 672 ATA TAA 2 
tRNAD ОЮ J 3 656 -3 728 73 CTT - 0 
ТАМА ^ (D) J 3 729 -3 791 63 GTC - 9 
aip8 J 3 801 -3 950 150 ATA TAA 1 
арб Ј 3 952 -4 584 633 АТТ ТАА -1 
cox3 J 4 584 -5 369 786 ATG TAA -1 
tRNACY (0) J 5 369 -5 433 65 TCC - 0 
nad3 J 5 434 —5 787 354 ATT TAA -1 
tRNA“ (А) J 5 787—5 850 64 TGC = -1 
tRNA 00 J 5 850 - 5 915 66 TCG 一 0 
ТАМА А" (N) Ј 5 916—5 981 66 СТТ 2 =) 
tRNASe C» J 5 980 -6 045 66 GCT - 2 
tRNACS™ Œ) J 6 048 -6 111 64 TTC = -6 
Repeat region - 6 106 -6 412 307 - - 
(ВМА? C) N 6 413 -6 477 65 GAA - 0 
nad5 N 6 478 - 8 217 1 740 ATT TAA 
tRNAH's (9) N 8 223 - 8 285 63 GTG = 0 
nad4 N 8 286—9 603 1318 ATA T -1 
nad4L N 9 603 -9 896 294 ATG TAA -2 
tRNAT CD J 9 895—9 956 62 TGT = 1 
tRNA?" P) N 9 958 - 10 023 66 TGG = 1 
nad6 J 10 025 - 10 519 495 ATT TAA -1 
cytb J 10 519 - 11 637 1 119 ATG TAA 4 
tRNASe -2 (9) J 11 642 - 11 706 65 TGA = 10 
nadl N 11 717 -12 652 936 ATT TAA 0 
(ВМА 1" 72 02 N 12 653 - 12 717 65 TAG - 0 
rrnL N 12 718 -13 977 1 260 3 = 0 
tRNA (У) N 13 978 – 14 039 62 TAC - 0 
rmS N 14 040 - 14 851 812 - - 0 
Control region 14 852 - 17 382 2 531 - - 0 
J: Majority strand ( J-strand) ; N; Minority strand ( N-strand) . 
Table 4 Nucleotide composition and skewness of the mitogenome of Myzus persicae 
Feature T (96) C (96) A (96) G (96) A+T (96) AT-skew GC-skew 
Whole genome 38.1 10.3 46.0 5.6 84.1 0.094 -0.296 
Protein-coding genes 47.6 8.9 35.2 8.3 82.8 -0.150 -0.035 
1st codon 37.0 9.9 47.3 6.1 84.3 0.122 -0.228 
2nd codon 39.0 10.8 44.3 5.7 83.3 0. 064 -0.305 
3rd codon 39.0 10.0 46.5 4.9 85.5 0.088 -0.352 
tRNA genes 40.1 8.4 45.5 6.0 85.6 0.063 -0.167 
rRNA genes 38.8 10.1 46.1 5.0 84.9 0.086 -0.338 
Control region 38.5 9.9 48.7 2.9 87.2 0.117 -0.547 
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Table 5 The number of codons and relative synonymous codon usage ( RSCU) in Myzus persicae 








Codon Count RSCU Codon Count RSCU Codon Count RSCU 
UUU(F) 428 1.71 UCU(S) 34 1.43 UAU(Y) 212 1.72 
UUC(F) 72 0.29 UCC(S) 2 0.08 UAC(Y) 34 0.28 
UUA(L) 380 3.83 UCA(S) 87 3.65 UAA(*) 12 0.36 
UUG(L) 49 0.49 UCG(S) 2 0.08 UAG( *) = - 
CUU(L) 76 0.77 CCU( P) 28 1.53 CAU(H) 53 1.61 
CUC(L) 12 0.12 CCC(P) 7 0.38 CAC(H) 13 0.39 
CUA(L) 60 0.61 CCA(P) 37 2.03 CAA(Q) 55 1.62 
CUG(L) 18 0.18 CCG(P) 1 0.05 CAG(Q) 13 0.38 
AUU(D) 370 1.69 ACU(T) 30 1.28 AAU(N) 176 1.77 
AUC(I) 40 0.18 ACC(T) 4 0.17 AAC(N) 23 0.23 
АПА (1) 247 1.13 АСА(Т) 59 2.51 ААА(К) 123 1.80 
AUG(M) 30 1.00 ACG(T) 1 0.04 AAG(K) 14 0.20 
GUU( V) 78 2.33 GCU(A) 18 1.38 GAU(D) 61 1.74 
GUC(V) 5 0.15 GCC( A) 6 0.46 GAC(D) 9 0.26 
GUA(V) 39 1.16 GCA(A) 28 2.15 GAA(E) 70 1.52 
GUG( V) 12 0.36 GCG(A) 0 0 GAG(E) 22 0.48 
UGU(C) 25 1.85 CGC(R) 2 0.20 AGA(R) 33 3.30 
UGC(C) 2 0.15 CGA(R) 14 1.40 AGG(R) 6 0.60 
UGA(*) - = CGG(R) 1 0.10 GGU(G) 23 1.35 
GGG(G) 8 0.47 








3.5 Non-coding regions 

The mitogenome of M. persicae contains 2 882 
bp of non-coding DNA (Table 3). The intergenic 
spacer between tRNA™ ( UCN) and nad1 is 10 bp in 
length, which is the same as those of Acyrthosiphon 
pisum, Diuraphis noxia, and Schizaphis graminum. 
This region corresponds to one of the binding sites of 
a mitochondrial transcription termination factor 
( Roberti et al., 2003), and possesses a 7 bp 
conserved motif ( Cameron and Whiting, 2008; 
Zhang et al., 2014). The corresponding motif is 
ATACTAA in M. persicae, А. pisum, and D. noxia. 
Generally, insect mitogenomes contain a control 
region and some short intergenic spacers, however, 


Phe 
° and 


there is a special repeat region between tRNA 
tRNA" in the mitogenome of M. persicae, as found 
Aphidinae except  Mindarus 


keteleerifoliae. 


in other insects 

The repeat region is located between tRNA“ 
and tRNA™. This region consists of multiple tandem 
repeat units, followed by a partial repeat unit. Both 
the length and copy number of the repeat units are 
different among aphid mitogenomes ( S. graminum , 
635 bp; D. noxia, 644 bp; and A. pisum, 1 513 
bp). The repeat region (307 bp) in M. persicae 
consists of two repeat units (79 bp for each) , with 
the first overlapping tRNA“" by 6 bp. 








The control region of the mitogenome of M. 
persicae is rich in A + T (87. 296) and located 
between rrnS and tRNA", the same as in other 
aphid species, it can be divided into four parts: 
repeat regions, А + T-rich zone, a conserved poly 
(T) stretch, and a stem-loop region ( Wang et al., 
2013, 2015a; Wang et al., 2014). The lengths of 
control regions in aphid mitogenomes are variable, 
for instance, the mitogenome of M. persicae (2 531 
bp) is currently the largest aphid mitogenomes ( from 
657 bp in Cervaphis quercus to 1 336 bp in A. 
pisum ). This 
attributed to their repeated regions. The first repeat 


difference in length is mainly 
is copied nine times, and every unit is 87 bp in 
length; the second repeat is copied eight times, and 
every unit is 131 bp in length. Furthermore, a 
microsatellite “ ( TA) ,;" was found ahead of the poly 
(Т) motif. 
3.6 Phylogenetic analysis 

In this study, the amino acid sequences of the 


13 PCGs 


phylogenetic relationships among the mitogenome of 


were concatenated to reconstruct 
M. persicae and those of 20 other insect species. 
These mitogenomes were found to represent three 
families within Hemiptera. The phylogenetic trees 
generated from ML and BI analyses showed the same 


topology (Fig. 5). The analyses supported that every 
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Fig. 2 Inferred secondary structures of 22 tRNA genes in the mitogenome of Myzus persicae 


The tRNAs are labeled with the abbreviations of their corresponding amino acids. The minus sign indicates Watson-Crick base pairing and a dot indicates 
G-U base pairing. 


superfamily in the tree forms a monophyletic clade. 
BI analyses showed that every superfamily in the tree 


Ferrari et al., 2006; von Dohlen et al., 2006; Wang 
et al., 2015a). 


f hyleti lade. Aphidid d 
od NU grob MP. cR 26 4 DISCUSSION AND CONCLUSION 
Aleyrodidae group into one clade and then 
demonstrate a stable sister relationship with The present paper reports the complete 


Psyllidae, and according to the taxonomy they all 
belong to Sternorrhyncha. The monophyly of 
Macrosiphini and Aphidini was recovered in the 
analyses and well supported. The mitogenomes of A. 
pisum and S. avenae are grouped with D. noxia, M. 
persicae and C. saliciola in the Macrosiphini and 
cluster with others. 
these 


Our phylogenetic analyses of 
were clearly segregated and 
demonstrated a similar topology with traditional and 
other molecular data (Blackman and Eastop, 1984; 


families 


mitogenome sequence of M. persicae (17 832 bp). The 
mitogenome of M. persicae has similar gene order and 
orientation to those of other aphids, but has the longest 
control region of all the sequenced aphid mitogenomes 
and its RNA structure is similar with that of other 
insects. In the mitogenome of M. persicae, a special 
repeat region (307 bp) exists between tRNA™ and 
tRNA“, as found in other Aphidinae insects except 
M. keteleerifoliae. Phylogenetic analysis based on 13 
PCGs showed that every superfamily in the tree forms a 
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Fig. 3 Predicted secondary structure of the rrnL gene in the mitogenome of Myzus persicae 
Roman numerals represent the conserved domain structures. Dashes indicate Watson-Crick base pairings and dots indicate G-U base pairing. 
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Fig. 4 Predicted secondary structure of the rrnS gene in the mitogenome of Myzus persicae 
Roman numerals represent the conserved domain structures. Dashes indicate Watson-Crick base pairings and dots indicate G-U base pairing. 
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Fig. 5 Phylogenetic tree inferred by Bayesian Inference (BI) and maximum likelihood (ML) methods based on the 


amino acid sequences of the 13 protein-coding genes ( PCGs) from Myzus persicae and 21 other insect species 


Bayesian posterior probabilities (front) and ML bootstrap support values ( back) are shown near the nodes. 


monophyletic clade. The mitogenome information 


inferred in this study may be limited, therefore, 
future research will involve in more taxon sampling to 
resolve this problem. 
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烟 蚜 线 粒 体 基 因 组 全 序列 及 系统 发 育 分 析 


HW Ж", EEE, RED, RAP, MRR 


(1. 贵州 大 学 烟草 学 院 , 贵阳 550025; 2， 贵 州 大 学 昆虫 研究 所 , 贵阳 550025; 
3. 贵州 省 烟草 公司 毕节 市 公司 , 贵州 毕节 551700) 











ME: (AM) 蚜虫 是 杂食 性 农业 害虫 。 本 研究 旨 在 通过 线粒体 基因 组 分 析 更 好 地 了 解 姬 科 昆 虫 的 
系统 发 育 关系 。【 方 法 ] 结合 第 二 代 测 序 和 PCR 扩 增 技术 获得 了 烟 蚜 Myzus persicae 线粒体 基因 组 
全 序列 ,与 蚜 科 其 他 昆 蛙 进行 了 对 比分 析 ; 以 贝 叶 斯 法 和 最 大 似 然 法 基于 13 个 蛋白 编码 基因 对 蚜 
科 进 行 了 系统 发 育 分 析 。[【 结果 】] 烟 蚜 线粒体 基因 组 (GenBank 登录 号 : KU. 236024) 序列 全 长 
17 832 bp, A + T 4 84. 196 , AT 4% & 7g 0.094,GC 偏 针 为 -0.296。 包 含 13 个 蛋白 编码 基因 
(сохі -3, nadl -6, nad4L, atp6, atp8 Fe cytb) ,22 个 tRNA,2 个 rRNA 基因 (rr 和 rmS) 和 2 个 长 
的 非 编码 区 ,其 基因 排列 顺序 与 已 知 的 蚜 科 昆虫 相似 ,除了 nad4 以 单独 的 了 结尾 ,所 有 的 蛋白 编码 
基因 均 以 ATN 作为 起 始 密码 子 ,TAA 作为 终止 密码 子 。 在 烟 蚜 线粒体 基因 组 中 ,tRNAS" 和 
tRNAP v [а] A — Ht 307 bp 的 非 编码 区 ,该 编码 区 包含 2 个 重复 单元 , 烟 蚜 的 控制 区 长 2 531 bp ,是 
所 有 测序 蚜 吕 线粒体 基因 组 中 最 长 的 。rr 取 的 二 级 结构 包含 6 个 结构 域 ,44 A XE ER 2838 MS 的 二 
级 结构 有 3 个 结构 域 ,24 个 茎 环 结构 。 基 于 烟 粉 息 和 其 他 20 种 昆 求 的 13 个 蛋白 编码 基因 重建 的 
BI 和 ML 系统 发 育 树 ,与 传统 形态 学 分 类 结果 一 致 。【 结论 ] 蚜 亚 科 和 长 管 蚜 亚 科 的 单 系 性 得 到 了 
很 好 的 支持 ;在 长 管 蚜 亚 科 的 分 支 中 ,Mpersicae 5 D. noxia 聚 成 一 支 , 并 且 С. salicicola 位 于 进化 
枝 的 底部 。 本 研究 结果 为 蚜 科 系统 发 生 关系 重建 积累 了 有 价值 的 数据 资料 。 
关键 词 : 蚜虫; 烟 蚜 ; 线粒体 基因 组 ; КМА; 二 级 结构 ; 系统 发 育 
中 图 分 类 号 : 0969 文献 标识 码 : A 文章 编号 : 0454-6296 (2017 )01-0084-11 
(责任 编辑 : 赵 利 辉 ) 


